Nemotron-H-4B-Instruct-128K is a large language model with 4 billion parameters developed by NVIDIA. It uses a hybrid architecture, supports a 128K long context, and is optimized for scenarios such as chatting, instruction following, and tool calls. It supports multiple languages, including Chinese, English, Japanese, etc., a total of 10 languages.
Natural Language Processing
TransformersEnglish